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Abstract 

This paper considers fully dynamic graph algorithms with both faster worst case update 
time and sublinear space. The fully dynamic graph connectivity problem is the following: given 
a graph on a fixed set of n nodes, process an online sequence of edge insertions, edge deletions, 
and queries of the form “Is there a path between nodes a and 6?” In 2013, the first data structure 
was presented with worst case time per operation which was polylogarithmic in n. In this paper, 
we shave off a factor of logn from that time, to 0{log‘^n) per update. For sequences which are 
polynomial in length, our algorithm answers queries in 0(logn/log logn) time correctly with 
high probability and using 0(nlog“ n) words (of size logn). This matches the amount of space 
used by the most space-efficient graph connectivity streaming algorithm. We also show that 
2-edge connectivity can be maintained using 0(n log^ n) words with an amortized update time 
of 0(log® n). 
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1 Introduction 


In the dynamic connectivity problem, we are given an undirected graph G = {V, E), where 1/ is a 
fixed set of n nodes, and an online sequence of updates and queries of the following form; 

• Delete(e): Delete edge e from E. 

• Insert(e): Insert edge e into E. 

• Query(x,y): Is there a path between nodes x and y? 

The goal is to process graph updates and queries as efficiently as possible. Our data structure 
solves the dynamic connectivity problem using 0(log'^) worst case time per update operation and 
0(logn/loglogn) worst case time per query. In addition, if the sequence of updates is polynomial 
in length, our data structure needs only a sublinear amount of storage, namely O(nlog^n) words 
(of size O(logn)), to ensure that with high probability, all queries are answered correctly. By way 
of comparison, the dynamic graph stream problem is to process an online sequence of polynomial 
number of edge insertions and deletions so that when the stream ends, a spanning forest is output, 
where the goal is to minimize space. Our algorithm is the first sublinear space dynamic graph 
algorithm: it matches the space needed by the most space efficient dynamic graph stream algorithm 
by Ahn, Guha, and McGregor from 2012 [T], and it shaves a factor of O(logn) off the worst case 
time per update of the best previously known dynamic connectivity algorithm by Kapron, King, 
and Mountjoy from 2013 [6]. 

As an application, we show the first sublinear space dynamic 2-edge connectivity algorithm, 
which uses 0(log® n) amortized update time, 0(logn/loglogn) query time, and 0(nlog^ n) words. 
This compares to the 0{m + nlogn) word algorithm with amortized update time O(log^n) and 
0(logn/loglogn) query time by Holm, de Lichtenberg, and Thorup from 1998 [5]. 

The sublinear space algorithm has 2-sided error of 1/n'^ for any constant c. If we keep a list 
of edges with additional space 0{m) then there is 1-sided error: if the answer to a query is “yes,” 
then the answer is always correct; if it is “no,” then it is correct with probability 1 — l/n'^ for c any 
constant. In both cases, we assume the adversary knows the edges in the graph, but constructs 
the sequence of updates and queries with no knowledge of the random bits used by the algorithm, 
other than whether the queries have been answered correctly or not. The edges of the spanning 
forest used by the data structure are not revealed to the adversary. 

Several dynamic graph problems can easily be reduced to the problem of maintaining connec¬ 
tivity. See [Ij. Thus our algorithm yields a dynamic graph algorithm for determining the weight 
of a minimum spanning tree in a graph with k different edge weights, where the cost of updates is 
increased by a factor of k. It also can be used to maintain the property of bipartiteness with only 
constant factor increase in cost. See mm- 

An Overview of Teehniques: Nearly every dynamic connectivity data structure maintains a spanning 
forest F. Dealing with edge insertions is relatively easy. The challenge is to find a replacement 
edge when a tree edge is deleted, splitting a tree into two subtrees. A replacement edge is an edge 
reconnecting the two subtrees, or, in other words, in the eutset of the cut (T, V\T) where T is one 
of the subtrees. To simplify notation, where there is no confusion, we allow T to denote the set of 
nodes in the tree T. Otherwise we will use the notation v{T). 

We compare the techniques of our paper to the 2013 dynamic graph connectivity [6] and the 
2012 dynamic graph stream paper [T]. All three papers use random sampling to identify the 
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name of a edge which crosses a cut. The 2013 dynamic graph paper introduced the cutset data 
structure. It maintains a dynamic forest of disjoint trees which is a subgraph of a graph G and, for 
each tree, a “leaving” edge, i.e., an edge with exactly one endpoint in the tree. That paper uses 
0(m log n + nlog^ re) words to implement the cutset data structure and ensures that edges leaving 
are identified with high probability. For each edge inserted, it flips fresh coins to determine which 
samples it appears in and records these outcomes in a table. 

The graph stream paper implicitly uses a restricted cutset data structure which accommodates a 
continuous stream of nontree edge insertions and deletions, with one final batch change of nontree 
edges to tree edges, at which point the leaving edge from a tree is determined with constant 
probability. The streaming paper uses O(relogre) words to implement this restrictive cutset data 
structure and O(relog^re) words overall. It uses hash functions rather than independent random 
bits to determine whether an edge is in a sample, eliminating the need for a table to remember the 
sampling outcomes. It uses a well-studied technique known as Lq sampling and 1-sparse recovery 
to sample, hnd an edge with constant probability, and then verify with high probability it is an 
edge. 

Our cutset data structure also determines an edge leaving with constant probability using 
O(relogre) words. It can be seen as using Lq sampling and a novel, possibly more practical, method 
of 1-sparse recovery. 

All three papers use O(logre) tiers of cutset data structures to build a spanning forest, similar 
to the Boruvka’s parallel minimum spanning tree algorithm. Each tier contains a forest, and each 
forest is a subforest of the tier above it. In the cutset data structure on tier z, the leaving edges 
discovered there become tree edges in the forest on tier i -|- 1, and so on. The streaming paper 
needs to form these forests only once at the end of the stream. In worst case, a single insertion or 
deletion can cause a superpoly logarithmic number of changes to edges leaving on the various tiers. 
Thus maintaining the spanning forest for each update that the streaming algorithm would have 
produced if the stream had ended at that update, is too costly in terms of time. By using high 
probability to find edges leaving, the 2013 dynamic graph paper avoids this problem by finding an 
edge leaving every non-maximal tree in a tier at an extra cost of log re factor in time and space. 
Our algorithm avoids the cost of using high probability; it does not necessarily keep the same tree 
edges as the streaming algorithm, but it ensures that about the same number of trees are merged 
on the next tier. This induces a subtle change to the invariants, which are less restrictive. 

Related work: For other related work on dynamic connectivity, see [6] 

Organization: In Section 2 we define the cutset data structure. In Section 3, we show how it can 
be used to solve dynamic connectivity. In Section 4, we discuss the implementation of cutset data 
structures. In Section 5, we give a space efficient dynamic 2-edge connectivity algorithm. 

2 Cutset data structure definition and results 

Consider a forest F of disjoint trees which are subtrees of a graph G. G and F are dynamic, that 
is, they are undergoing updates in the form of edge insertions and deletions. In addition, an edge 
in G which links two trees may be updated to become a tree edge which would then cause the 
two trees in F to be joined as one. Similarly a tree edge may be changed to a non-tree edge. The 
problem is to design a data structure with the goal of maintaining, for each tree T in F which is 
non-maximal, an edge in the cutset of (T, F \ T). A successful Search (T) returns this edge if the 
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cutset is non-empty and null otherwise. We assume that the graph updates are oblivious to the 
(randomized) outcomes of Search. I.e., we may consider the sequence of updates to be fixed in 
advance, but updates are revealed to the algorithm one at a time. The operations are: 

• lnsert_edge(a:, y): Adds edge {x,y} to E. 

• Make_tree_edge(x, y): For {x,y} € E but not in F, adds {x,y} to F. 

• Make_nontree_edge(x, y): Removes {x,y} from T. 

• Delete_edge(a:, y): Removes {x,y} from E. If {x,y} is an edge of F, removes it from F. 

• Search(T): For T ^ F, returns null if there are no edges of E in the cutset of (T, R \ T); 
returns an edge in th cutset of (T, V \ T) otherwise. 

Lemma 2.1. Let c > 0 be any constant. There is a cutset data structure for graph G with forest F 
with worst case update time O(log^n) and space 0(n log n) words, such that for any T ^ F, where 
C is the cutset of {T,V \ T), if C then with constant probability p > 1/8 — l/n‘^, Search(T) 
returns an edge in the cutset; with probability less than 1/n^ it returns a non-edge or edge not in 
C; otherwise it returns null. If C = tb, it always returns null. 

With an auxilliary list ofO(m) words to keep track of the current edge set in the graph, Search(r) 
returns an edge in C with probability 1/8 and always returns an edge in C, if it returns an edge. 

The implementation of the cutset data structure is discussed in Section 01 

3 Fully dynamic connectivity algorithm 

Theorem 3.1. For any constant c and any sequence of polynomially many updates, there is a 
dynamic connectivity algorithm which answers all queries correctly with high probability and uses 
O(log^n) time per update and O(nlog^n) words. The error is 2-sided, i.e., ”yes” may be output 
when the correct answer is ”no” and vice versa, with probability less than 1/W. 

If a list of current edges in the graph is maintained (with an additional 0{m) words), any 
sequence of any length of updates may be processed so that with high probability any single query 
is correct. In this case, the error is 1-sided; if an answer to a query is “yes” the answer is always 
correct and if it is “no”, it may be incorrect with probability 1/W. 

3.1 Tiers and Invariants 

Recall Boruvka’s parallel algorithm for building a minimum spanning tree [2]. We may view that 
algorithm as constructing a sequence of forests we call tiers. On tier 0 are the set of nodes in the 
graph; on tier £ -|- 1 is a subforest of the minimum spanning forest formed when each 1—1 tree 
picks a minimum cost edge linking it to another 1—1 tree, 

Our algorithm has similar structure. For each tier i = 0,1,...,top, where top = O(logn), we 
maintain a cutset data structure CT>i for G with a forest Fg and refer to the collection as CT). Each 
cutset data structure in CT) is generated using independent randomness. Because tier top is not 
used to form a forest on a higher tier, the data structure only consists of a forest Ftop, i.e., there is 
no randomness array on this tier. 

For each tier I < top, Search(T, £) refers to the Search(T) operation in the cutset data structure 
CT>i where T G F/; we similarly extend the definitions of other operations of a cutset data structure 
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for each tier 1. Since there is no randomness array on tier top, for Delete_edge(e, top) we remove e 
from Ftop if it appears there but do nothing else. 

As the graph is updated, the cutset data structure for tier i is used to find edges leaving non- 
maximal trees (“fragments”) in that tier. Given an edge e, let i be the minimum tier in which e 
first appears in F^. We then refer to it as a tier i edge. We maintain the following invariants of 
CD. Let G be the current graph. 

1. Fq = V, the vertices of G; 

2. For every tier i, 0 < £ < top — 1, F) C F^+i C G. 

3. Let T be a node in tier £ < top. If Search(r, £) is successful then v{T) is properly contained 
in some v{T') for T' a node on tier £ -\-l. 

Discussion: Note that these invariants do not necessarily preserve the property that every edge in 
the spanning tree Ftop is currently an edge leaving some fragment in some CD. Rather, Invariant 
[3] ensures that any fragment T in tier £ for which Search (T) is successful has some edge leaving it 
in F)+i. Also, the invariants do not preclude there being a tree edge in F)+i which is incident to a 
fragment T in F^ when Search (T) is not successful. The edges in the forests may be thought of as 
“historical” Search edges, edges which were once returned by Search. This gives us the flexibility 
to perform updates quickly. 

Markov’s inequality implies that with constant probability, a constant fraction of executions of 
Search on fragments are successful: 

Lemma 3.2. Let p be defined as in Lemma [2.11 In a cutset data structure with fnum fragments, 
with probability at least a = (1 — p)/(l — p/2) the number of fragments T such that Search(T) 
succeeds is at least {p/2)fnum- 

Proof. If there are fnum fragments in the cutset data structure, then the expected number of 
fragments for which search fails to succeed is no greater than p = (1—p)/ since each has probability 
at least p of finding a tree edge. By Markov’s Inequality, the probability that (1 — p/2) fnum fail to 
find a tree edge is no more than p/(l — p/2) fnum = (1 “P)/(l —p/^)- D 

As any edge can leave at most two node-disjoint fragments, it is easy to see the following: 

Observation 1. Let T be a forest containing / fragments and E' be a set of edges leaving at least 
cf fragments T then the spanning forest of T U S' has no more than (1 — c/2)/ fragments. 

Lemma 3.3. For any graph G and any constant c, let a = |'log 4 p 4 _p) n] and top = max{2a/Q;, 8c In n/a}, 
where a = (1 —p)/(l —p/2). If the invariants hold then with probability 1 — 1/n'’, Ftop is a spanning 
forest of G. 

Proof. Let p be the constant in Lemma 12.11 We call a tier with fnum fragments successful if the 
number of fragments T where Search(T) is successful is at least {p/2)fnum- Let = 1 if tier £ 
is successful and 0 otherwise. Then by Lemma 13.21 Pr{Xi = 1) > a. For any tier £, if there are 
{p/2) fnum fragments with successful searches in a tier £ then by Invariant^ each of these fragments 
is properly contained in F^+i, i.e., the set of tier £ + 1 edges includes an edge leaving for each of 
these. Then by Observation [H the number of fragments in F/+i < (1 — p/^)fnum- 

Since (1 —p/Dfin < 1, a successful tiers suffice to bring the number of fragments to 0 (as there 
can’t be only 1). If we flip max{2a/a, 8cIn n/a} coins with probability of heads a, the expected 
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number of heads at ;U = max{2a, 8 cln n}. The probability that the number of heads is less than 
a = /i /2 is is less than = l/n'^, by a Chernoff bound. □ 

We note that the results for a fixed graph can be extended to hold for poly{n) arbitrary sub¬ 
graphs, including those generated by a sequence of polynomially many updates: 

Corollary 3.4. Let be the graph G after u updates, let d, d be any constants, and assume 
u < n^. If top = max{2a/a, 8 (c' + d)hin/a}, then in the cutset data structures CV'^ for each 
with probability 1 — 1/n^ every Fijfp is a spanning forest of This holds even if there is 
dependence in the randomness between any pair of CD^ ’s. 

Proof. Consider a collection of n'^ graphs G*. Set c = d + d in Lemma 13.31 By a union bound, the 
probability that some fail to be a spanning forest of G* is no greater than n^{\/n^) = 1 /n^ . □ 

3.2 Performing updates 

We say a fragment in a tier I is isolated if it is equal to a fragment in tier I+l. To efficiently maintain 
the invariants as edges are updated, we only check a cutset induced by a tree if the invariants could 
have become violated. The invariants are maintained from the bottom up. Changes to a tree may 
be needed because ( 1 ) a fragment becomes isolated when a tree edge is deleted; ( 2 ) for an isolated 
fragment T, Search (T) may become successful, when before it wasn’t or (3) a fragment which is 
newly formed is isolated. 

Let T{x) denote the tree containing node x in a forest F. If edge {x,y} is inserted, we start at 
the lowest tier, checking on each tier i as follows: if T{x) 7 ^ T{y) this may affect Search(T(x), £) 
and Search(T(//), £). If T{x) is isolated and Search(r(x), £) is successful, the edge returned by T{x) 
is added to T/+i. Once a tree edge is added it must be added in all higher tiers. If this forms a 
cycle, a highest tier edge in the cycle is made into a non-tree edge. This may in turn cause an 
isolated fragment T' on a higher tier F, for which Search(T', £') must be checked. If edge {x,y} is 
deleted, it does not affect the results of a search if both x and y are in the same fragment. If they 
are not, then calls to Search for both x and y are potentially affected. At this point we proceed as 
we do in the case of insertions. 

We now give the algorithms for handling updates and queries. The following procedure, which 
restores the invariants after an update involving an edge {x, y}, as described above, is used for both 
insertions and deletions. 

• Initialize: For each tier i = 0, 1,2, ...,top — 1, initialize a cutset data structure CDi where Fi 
contains n trees consisting of the single vertices in G. 

• Insert(e): For £ < top, call lnsert_edge(e, to insert e into CVi, and call Refresh(e). 

• Delete(e): For £ < top, call Delete_edge(e, f) to delete e from CVe, and call Refresh(e). 

• Query(x,//): If T{x) = T{y) in Ftop, return “yes”, else return “no”. 

A proof of the following is given in the Appendix. 

Lemma 3.5. Invariants (1), (2) and (3) are maintained by Initialize, Insert, and Delete. 
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Algorithm 1 Refresh({x, y}) 

1: for £ = 0,..., top — 1 do 
2: for u G {x, y} do 

3: if T{u) is isolated and Search(T(u), f) returns an edge e„ = {a,b} then 

4: if a path between a and b exists on some tier > i then 

5: Let j be the lowest tier on which there is a path 

6: Let e' be an edge of maximum tier in the path on tier j 

7: Call lnsert_edge(e', /c) for each k> j 

8 : Call Make_tree_edge(eu,f') for each i', top > i' > i 

9: end if 

10: end if 

11: end for 

12: end for 


3.3 Implementation details, running time and space analysis 

We briefly review basic data structures used. ET-trees are based on the observation that when 
a tree is linked with another tree, the sequence given by the Euler tour of the new tree can be 
obtained by a constant number of links and cuts to the sequences of the old trees. The sequence is 
stored in a balanced ordered binary tree. 

ET-tree properties: If each node in the graph has a value stored in x words of length O(logn) then 
the following hold. 

1. A tree edge can be deleted or inserted in O(xlogn) time. 

2. The sum of the values stored in the nodes of a given tree can be returned in 0{x) time. 

3. A value at a node can be updated in O(xlogn) time per word. 

4. Given a node v, the name of the tree T{v) containing v can be returned in O(logn) time. 

5. The name of the tree can be obtained more quickly if one keeps an ET-tree of degree 0(log n) 
solely for this purpose. That is, it can be obtained in O(log n/log log n) if one is willing to 
spend 0 (log^ n) time doing an insertion or deletion of a tree edge. 

ST-trees (aka link-cut trees) provide a means of finding the maximum weight edge on a path 
between two nodes in a dynamic forest, in O(logn) time per tree edge link, cut, and the find 
operation. ST-trees and ET-trees require only 0{n * \value\) space where \value\ is the size of the 
vector or value stored at each node or edge. 

Implementation details: Ftop is stored as a degree-lgn ET-tree where no values are associated with 
a node, so that queries can be answered in time 0(log n/ log log n). Ftop is also stored in an ST-tree 
where each tier £ tree edges has weight £. An ST tree data structure allows us to find the cycle 
edge in O(logn) time. 

In the worst case, a new tree edge is found in every tier twice and these changes are filtered up, 
for a total of 2 top — £ tree edge insertions or deletions: 

Lemma 3.6. Let t{n) be the update time for a cutset data structure and s{n) be the space. Then 
the running time and space for the fully dynamic connectivity algorithm is 0{{top^*t{n)+top*logn) 
and the space needed is 0{top * s(n) -T n). 
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Since top = O(logn), Lemma 13.61 and Lemma l2 .1 1 imply Theorem 13.11 


4 Implementation of the Cutset Data Structure 

To illustrate the main idea of our data structure, we will first explain a simplified deterministic 
version which works when there is exactly one edge in each cutset. 

Let G = {V, E) be the current graph. Each edge {x, y} € G, 0 < x < y < n, has the name 
Xb ■ 7 /fe where Xb and yb are the [Ign] bit vectors containing the binary representations of nodes x 
and y respectively and Xb ■ Pb denotes the 2[lgn] bit vector Xb followed by pb- We say that edge 
{x, y} is given by Xb ■ Pb- Each node will maintain a vector equal to the bitwise XOR (i.e., sum mod 
2 ) of the vectors assigned to each of its incident edges and we use summation notation to denote 
bitwise XOR of two or more vectors. Eor each tree T € E, the ET-tree for T maintains the bitwise 
XOR of the vectors stored at T’s nodes. We use “addition” or “subtraction” of vectors to refer to 
the bitwise XOR of the vectors. The operations are implemented as follows: 

• lnsert_edge(x, y): Add its name Xb ■ Pb to the vectors at nodes x and y. 

• Make_tree_edge(x, y): Insert {x,y} into F (by linking the ET-trees representing T{x) and 
r(y)). 

• Make_nontree_edge(x, y): Delete {x, y} from F by removing it from the ET-tree that contains 
it. 

• Delete_edge(x, y); Subtract vector Xb ■ Pb from the vectors at the nodes x and y. If the edge 
is in F, remove it from F by calling Make_nontree_edge(x, y). 

• Search(r): . Let 2 ; = 212 : 2 ... 2 : 2 [ign] be the sum of the vectors of nodes in T. If 2 7 ^ 0 then 
return 2 . 

Lemma 4.1. For any tree T £ F, if there is exactly one edge in the cutset G of the cut {T,V\T), 
then Search(r) is always successful. The amount of space used is O(nlogn) bits. 

Proof. Every edge with two endpoints in a tree T is added to the vectors of two nodes in T and 
contributes twice to 2 , thus it appears not all in the sum 2 , while every edge with exactly one 
endpoint contributes exactly once to 2 . If the cutset has exactly one edge, 2 is the name of the 
edge. □ 

4.1 Extension to cutsets with more than one edge 

We extend our data structure to handle the case of when the cutset of (T, V \T) has more than 
one edge. We may do this in sublinear space, but then we require the assumption that no edge is 
deleted unless it is currently in the graph and the number of updates is bounded by for some 
constant d. Alternatively, we can keep a list of edges currently in the graph. 

We maintain for each node x € E an array indexed by levels i where 0 < i < [21gn] = 
levelNum. Let Sj(x) denote the vector on level i for node x. When edge {x,y} is inserted into G, 
for each i = 0,1,..., levelNum — 1 with probability 1/2* we add Xb ■ Pb to both Sj(x) and Sj(y). The 
idea is that when the size of the cutset is ~ 2 *, with constant probability there is exactly one edge 
from the cutset and a unique x in T, such that the name of the edge is added to Sj(x). In this case, 
Zi = Yhx&T the name of that one edge and Search(T) is successful. 
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To decide which edges to included in the sums on the various levels without recording each 
coinflip, we use a 2 -wise independent hash function h mapping edges into where for 

k>l, [/c] denotes k}. An edge e is sampled in level i, i = 0, levelNum of the cutset data 

structure iff h{e) < 2 *. 

The following lemma is straightforward and has been observed previously. A proof is in the 
Appendix. 

Lemma 4.2. Let W be the cutset of {T,V \ T). With probability 1/8, there is an integer j such 
that exactly one e €W hashes to a value in [2-^]. 


To verify with high probability that zt is actually the name of one edge in the cut rather than 
the sum of several names, we make use of an odd hash function^ 

We say that a random hash function / : [1, m] —)• {0,1} is e-odd, if for any given non-empty set 
S C [l,m], there are an odd number of elements in S which hash to 1 with probability e, that is, 


Pr 

/ 


-xgS 


mod 2 


> e. 


( 1 ) 


We use the construction of [7j to create a (l/ 8 )-odd hash function. Let m < 2'^, where w = 
2[lgn]. We pick uniformly at random an odd multiplier k from [1,2"'] and a threshold t G [1,2"']. 
From these two components, we define / : [1, 2"'] —> {0,1} as follows: 


f{x) = 1 if {kx mod 2 "') < t and x 7 ^ 0 

= 0 otherwise. 


The idea of the verification is as follows: we use a 2-wise independent function to randomly 
partition a set into two and test each part using using an odd hash function to determine if both 
parts have at least one element. Repeating this 0(log n), if all tests fails, then with high probability 
there cannot be more than one element in a set. 

In particular, letting S = [( 2 )]) the algorithm is initialized by choosing the following functions 
independently at random. 

• hj : S ^ {0,1}, j = 1,2, ...,clgn. Each hj is a 2-wise independent hash function. 

• fj^b ■ S — 7 > {0,1} for j = 1, 2,..., clg n, and 6 = 0,1. Each fj^b is an independently random 
(l/ 8 )-odd hash function. 

The /j,b’s are used to determine a 2clgn-bit tag of an edge at a particular level. For each node 
X, we maintain an auxilliary vector s'^^(x). The tag of an edge is added (by bitwise exclusive-or) 

^ We may view this verification problem as an instance of 1-sparse recovery and it is possible to apply known 
techniques for this problem in our setting. In particular [3] proposes an approach which uses polynomial evaluation 
modulo a prime p to verify uniqueness for recovered values (from a set of size t,) with success probability (1 — t/p). 
This method maintains sums of the form where S' C [t], at is the weight of item i, and « € Zp is randomly 

chosen. Using this approach for our purposes, in order for Search to maintain correct values with probability (1 — 
over a sequence of n'' edge updates, we must work over p = Adding an edge would require adding (mod p) 

a term ± 2 *, where 1 < i < (j). To the best of our knowledge, this requires performing roughly 21gn multiplications 
of (c -I- d -I- 2) Ign-bit numbers, with reduction modulo p, or alternatively, c -I- d independent parallel repetitions of 
the technique with (c -I- d) primes of size > 41gn bits. In contrast, our approach may be done with (c -I- d)lgn 
multiplications of 2Ign-bit random numbers, with reduction modulo a power of 2. 





to the vector s'-jfj{x) when the edge name is added to Si(x). For each level i, the sums over the 
nodes in a tree of the auxiliary vectors are maintained as well as the sums of the vectors. Let 
^ijb ~ j bi^)• specifics are described below. 

The operations Make_tree_edge(a:, y) and Make_nontree_edge(a:, y) are implemented as in the case 
where cutsets have size one. The remaining operations are implemented as follows: 

• lnsert_edge(a;, y): 

For i = 0,1, 2,... , levelNum — 1 

If h{e) < 2* then 

-{Add e to sample } add its name Xb • yb to value Si{x) and Si{y) and 

For j = 1,..., c' Ign and 6 = 0,1: 

-{Add tag for e:} If hj{e) = b and /j',6(e) = 1 then add 1 to and s[^i^{y). 

• Delete_edge(x, y): 

For z = 0,1,2,.., levelNum — 1: 

If h{e) < 2* then 

-subtract its name Xb ■ yb to value Si{x) and Sj(y) and 

-For j = 1,..., d Ign and 6 = 0,1: 

If hj{e) = b and /j,fe(e) = 1 then subtract 1 from s'and s'jf,(y). 

If the edge is in F, call Make_nontree_edge(x, y) to remove it from the ET-tree containing it. 

• Search(r): 

For i = 0,1, 2,..., levelNum — 1: 

While an edge in the cutset (T, V \ T) has not been found, test Zi as follows; 

Let {z^, z^} be the edge given by the minimum i such that Zi ^ 0. 

If there is no pair j such that z[ ^ q = z[ ^ = 1, then it is an edge in the cutset and returned. 

Else null is returned. 

Let Zi be the sum of the level i vectors for a tree T, and i be the minimum level such that Zi ^ 0. 
Then with constant probability p, Zi is the name of an edge in the cut (T, V \T). If Zi ^ 0 then if 
if Zi is not the name of an edge in G, then with high probability, for some pair j, z'j q = z'j ^ = 1. 

If 7 ^ 0 there is must be at least one edge in the cut, since otherwise h{e) = 0 for all edges 
e. If there is more than one edge in the cut, let e and d be two such edges. Eor 6 = 0,1, let 
S';, = {e I hj{e) = b}. Since each hj is 2-wise independent then for any j, hj{e) / hj{e') with 
probability 1/2, and therefore with probability 1/2, |5o| > 0 and IS*!! > 0. Recall that hj,fjfi and 
fj^i are independently chosen random hash functions. Therefore, given that |5o| > 0 and |5i| > 0, 
zGq = zF^ = 1 with probability (l/2)(l/8)^ = 1/128. The probability that this equality fails 
for all j = 1, 2,..., c(128) Ig re pairs is (1 — 1/128)'^^^^®*®"^ which is less than 1/n^ for sufficiently 
large re. Therefore w.h.p., when Search returns an edge it is an edge in the cut. The probability of 
correctness as stated in Lemma [Q then follows. 

Running time for the cutset data strueture 

Each hash function requires a constant time to perform the hash. 0(log re) hashes are performed 
to determine if an edge should be sampled for level i and for computing its tag. Each node in a tree 
in the forest of a cutset data structure has a vector of O(logre) words, since there are O(logre) levels 
for each level there are two 0(log re) bit vectors, Si{x) and s^{x). Maintaining the sum of vectors of in 
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an ET-tree of 0(log n) vectors of size 0(log n) requires 0(log^ n) time. Similarly, Make_nontree_edge 
or Make_tree_edge requires O(log^n) time. Search{T) involves finding the maximum level i with 
non-zero Zi and checking its tag or determining if zt is on the current edge list and determining the 
name of the trees containing these endpoints all take no more than O(logn) time each. Hence we 
can conclude that the update time for edge insertion or deletion is 0(log^ n) time while the search 
time is O(logn). This concludes the proof of Lemma l2.11 

5 2-edge-connectivity 

In this section, we treat the dynamic 2-edge connectivity algorithm [5] and two instances of the space 
efficient dynamic connectivity data structure presented previously as black boxes, which we denote 
by 2-EDGE, CONNl, and CONN2 respectively. All are generated using independent randomness. 
Using these black boxes, we will present an algorithm to solve dynamic 2-edge connectivity in 
0(n log^ n) words with the same query time and an update time which is within a 0{log^n) factor 
of [5], i.e., 0(log®n) amortized update time and 0(logn/loglogn) query time. The idea is to 
maintain a certificate for 2-edge connectivity using the dynamic connectivity method shown in the 
previous sections, and use that certificate as the input into 2-EDGE. In the algorithm below, We 
give some definitions: 

• G = {V, E): the dynamic graph 

• Fi: A maximal forest of G maintained by CONNl 

• Gi = (U, El): The graph with the forest edges removed. Ei = E \ Fi 

• F 2 : A maximal forest of Gi maintained by CONN2 

• Gc = (U, Ec) The graph which is the input to 2-EDGE. 

• F: The 2-edge connected forest of Gc maintained by 2-EDGE 

We now present algorithms for handling updates and queries: 

Insert (e): 

1. Insert e into E and Ei 
{Process updates to Fi} 

2. For each edge e' added to Fi, delete e' from Ei 

3. For each edge e' removed from Fi insert e' into Ei 
(Process updates to FiU F 2 } 

4. For each edge e! added to Fi Li F 2 , insert e' into Ec 

5. For each edge e' removed from Fi LI F 2 , remove e' from Ec 

Delete edge e = (u, v) is the same as Insert{e) except for the first step which is replaced by “Delete 
e from E and Ei”. 

Query{u,v): Query 2EDGE to see if u and v are 2-edge connected. 

Theorem 5.1. W.h.p., fully dynamic 2-edge connectivity can he maintained in 0(log® n) amortized 
time per update and 0(logn/loglogn) per query using space O(nlog^n) words, over a polynomial 
length sequence of updates. 

A proof is given in the Appendix. 
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Appendix 

Additional proofs 

Lemma 3.5. Invariants (1), (2) and (3) are maintained by Initialize, Insert, and Delete. 

Proof. It is clear that all invariants hold after a call to Initialize. We now consider update operations. 

Maintenance of Invariant (1) holds trivially because Refresh never makes an edge into a tree 
edge on tier 0. 

For Invariant (2), first note that Refresh checks for a cycle and removes an edge from a potential 
cycle before inserting an edge which causes a cycle in a tree, and so each is a forest. For the 
inclusion property Fi C first note that when handling deletions, the same edge is deleted 

from every tier, so this does not violate the invariant. For both insertions and deletions, line [8] of 
Refresh makes e„ a tree edge on all tiers > I, so inclusion is immediately maintained when no cycles 
are created. Now suppose edge e„ would form a cycle if inserted. Let e! be an edge of maximal 
tier f in this cycle. We first note that j > £ J- 1, due to the fact that T{u) is isolated. So by the 
minimality of j and the maximality of the tier of e', it must be the case that e' is not an edge in 
the forest on any tier < j. Therefore, U is removed from all forests containing it, and the inclusion 
property of Invariant 2 is maintained. 

Invariant (3) fails only if there is an isolated fragment T on some tier 1 for which Search (T, I") 
is successful. We call this a bad T. Initially, if we start with an empty graph, there is no bad T. 
We assume there is a first update during which a bad T is created and show a contradiction. A 
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bad T is created either if (1) T is newly formed by the update or (2) T pre-exists the update and 
Search (T,^) is successful in the current graph G. In (2) there are two subcases: T wasn’t isolated 
before the update or T was isolated before the update but Search(T, £) became successful after the 
update. Let {x, y} be the updated edge. We first show: 

Claim: Before the call to Refresh, the only bad fragment introduced on any tier has the form T{x) 
or T{y), and this remains the case during the execution of Refresh. 

Proof of Claim: Before the call to Refresh, there are two ways in which this update could create a 
bad fragment. The first, which may occur for both insertions and deletions, happens when there is 
a fragment T on a tier i for which Search (T,^) did not return an edge before the update, but now 
does. This can only be the case if the cut induced by T has changed, and this is only when exactly 
one of X or y is a node in T. The second can happen before a Refresh and only for deletions, in 
particular when {x,?/} is removed. In this case, two new fragments T{x) and T[y) are created and 
no other cuts are changed. Hence the claim is proved for the bad fragments created before the call 
to Refresh. 

Now, we consider what happens during Refresh. During the execution of Refresh, a bad frag¬ 
ment may result when is made into a tree edge on some tier j > £ on Line El causing two 
fragments T and T' to be joined into one new fragment which may be bad. Since is the result of 
Search(T(x), or Search(T(y), £) for some the new fragment contains x or y, and by Invariant 2, 
so does every tree on higher tiers created by inserting e„. Note that replacing one edge by another 
edge in a cycle does not affect any cut, so the replacement of e' by does not cause the creation 
of a new bad fragment. This concludes the proof of the claim. 

Given the claim, we now show that after calling Refresh, Invariant (3) holds. The proof is by 
induction on the value of ^ in the for-loop of Refresh. Assume ^ < top. Suppose that before iteration 
£, Invariant 3 holds on all tiers < i ov i = 0. This will still be the case after iteration £, since this 
iteration only makes changes on tiers > £. Suppose that T(x) E is bad. This means that T(x) 
is isolated, and Search(T(x), £) returns an edge e^. Since e„ is inserted into T{x) in tier i 

is no longer bad, and Invariant (3) holds on all tiers < The same argument applies to T{y). 
Maintenance of Invariant (3) now holds by induction for all I < top. □ 

Lemma 14.21 follows immediately from the following. 

Lemma 5.2. With probability 1/8, there is an integer j such that exactly one w hashes to a 
value in [2-^]. 

Proof. Let i = ( 2 ). We prove the statement of the lemma for j = i — [lg|IT| — 2]. Then 
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1/(4|VF|) < 2V2^ < 1/{2\W\). 

Pr[3! weW ■. h{w) G [2^']] 

h 

= Pr[/i(t(;) € [2-^] A Vw' G TP \ {t(;} : h{w') 0 [2-^]] 

w£W 


We evaluate the inside of the summation: 

= Pr[/i(u;) G [2^]] Pr[Vu;' G W \ {u;} ; h{w') ^ [2^] \ h{w) G [2^]] 

h h 

> Pr[/i(ui) G [2-^]](l — Pi[h{w') G [2-^] | h{w) G [2-^]]) 

h ^^ h 

w' GW\{w} 

By 2-wise independence: 

= Pr[ft(u,) e [2i|]{l - ^ Pr|/i(«,') £ [2^1]) 

h h 

w' ^W\{w} 

= (2V2^ • (1 - [{\W\ - 1)2^/2^]) 

> (1/(4|W|)(1 - [(|W| - 1)/(2|W|)] > l/(4|W|)(l/2)] = 1/(8|W|) 
ThenPr[3!tc G W : h{w) G [2-^]] > 1/8. 


□ 

Theorem 5.1. W.h.p., fully dynamic 2-edge connectivity can be maintained in 0(log® re) amortized 
time per update and 0(logre/ log log re) per query using space 0(relog^ re) words, over a polynomial 
length sequence of updates. 

Proof. First we prove correctness: Let F be a spanning forest of G and F' be a spanning forest 
of G \ F. Then F U F' is a certificate of 2-edge connectivity, i.e., if G is 2-edge connected then 
so is F U F'. The correctness then depends on whether Fi and F 2 are correctly maintained as the 
spanning forests of G and G \ Fi respectively. By the proof of Theorem EH Fi = Ftop maintained 
by CONNl is the spanning forest of G w.h.p. Since CONN2 uses independent randomness from 
CONNl, the updates to Fi and therefore the updates of the input graph G \ Fi to CONN2 are 
independent of the randomness in CONN2, similarly F 2 = Ftop maintained by CONN2 is a spanning 
forest of G \ Fi w.h.p. By a union bound, both forests are maintained correctly w.h.p. We note 
that 2EDGE is always correct. 

We observe from the proof of Theorem 13.11 that a single update to E results in O(log re) updates 
to Ftop in CONNl, which in turn result in O(log^re) updates to Ec- Each update to Ec has 
amortized time O(log^re) in 2EDGE. Hence the total time per update is 0(log‘^re) worst case time 
in CONNl plus O(log^) worst case time in CONN2 plus Oifog^) amortized time in 2EDGE, for a 
total amortized update time of 0(log®re). As the space used by 2EDGE is 0(rre -|- re log re) words 
and |Fc| < 2re, and the space used by CONNl and CONN2 is O(relog^re) words, the total space 
used is O(relog^re) words. 

□ 
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